arXiv:1507.07368vl [cs.DM] 27 Jul 2015 


Almost Optimal Cover-Free Families 


Nader H. Bshouty and Ariel Gabizon 

Department of Computer Science 
Technion, Haifa, 32000 


Abstract. Roughly speaking, an (n, (r, s))-Cover Free Family (CFF) is 
a small set of n-bit strings such that: “in any d := r+ s indices we see all 
patterns of weight r”. CFFs have been of interest for a long time both in 
discrete mathematics as part of block design theory, and in theoretical 
computer science where they have found a variety of applications, for 
example, in parametrized algorithms where they were introduced in the 
recent breakthrough work of Fomin, Lokshtanov and Saurabh |16| under 
the name ‘lopsided universal sets’. 

In this paper we give the first explicit construction of cover-free fam¬ 
ilies of optimal size up to lower order multiplicative terms, for any r 
and s. In fact, our construction time is almost linear in the size of the 
family. Before our work, such a result existed only for r = d°^^K and 
r = cu(d/(log log dlog log log d)). As a sample application, we improve 
the running times of parameterized algorithms from the recent work of 
Gabizon, Lokshtanov and Pilipczuk )18| . 


1 Introduction 

The purpose of this paper is to give an explicit almost optimal construction of 
cover free families |20) . Before giving a formal definition, let us describe the 
special case of group testing. The problem of group testing was first presented 
during World War II and described as follows [TUES]: Among n soldiers, at most 
s carry a fatal virus. We would like to blood test the soldiers to detect the 
infected ones. Testing each one separately will give n tests. To minimize the 
number of tests we can mix the blood of several soldiers and test the mixture. If 
the test comes negative then none of the tested soldiers are infected. If the test 
comes out positive, we know that at least one of them is infected. The problem 
is to come up with a small number of tests. 

To obtain a non-adaptive algorithm for this problem, a little thought shows 
that what is required is a set of tests such that for any subset T of s soldiers, and 
any soldier i ^ T, there is a test including soldier i, and precluding all soldiers 
in T. Let d = s -I- I. Viewing a test as a characteristic vector a € {0,1}" of the 
soldiers it includes, the desired property is equivalent to the following. Find a 
small set T C {0,1}" such that for every 1 < ii < ^2 < • • • < id < n, and every 
1 < J < d, there is a € such that = 1 and Oi^. = 0 for all k ^ j. 




1.1 Cover-Free Families 


We can view T described above as a set of strings such that “in any d indices 
we see all patterns of weight one”. We can generalize this property by choosing 
an integer 1 < r < d and requesting to see “in any d indices all patterns of 
weight r”. 

Definition 1 (Cover-Free Family). Fix positive integers r,s,n with r,s < n 
and let d := r + s. An (n, (r, s))-Cover Free Family (CFF) is a set T C {0,1}" 
such that for every 1 < < 12 < • • • < id < n and every J C [d] of size | J| = r 

there is a € F sueh that = 1 for j £ J and = 0 for k ^ J. 

We will always assume r < d/2 (and therefore r < s): If not, construct an 
(n, (s,r))-CFF and take the set of complement vectors. 

We note that the definition of CFFs usually given is a different equivalent 
one which we now describe. Given an (n, (r, s))-CFF F, denote N = |J^| and 
construct the N x n boolean matrix A whose rows are the elements of F. Now, 
let W be a set of N elements and think of the columns of A as characteristic 
vectors of subsets, which we will call blocks, B C X. That is, if we denote by 
B = {Bi, ..., Bn} the set of blocks corresponding to these columns, then A is the 
incidence matrix of B, i.e. the i’th element of X is in Bj if and only if Aij = 1. 

For this view, the CFF property of F implies the following: For any blocks 
Bi,... ,Br £ B and any other s blocks Ax,... ,As £ B (distinct from the B's), 
there is an element of X contained in all the B’s but not in any of the A’s, i.e. 

r s 

i=i i=i 

This property is the usual way to define CFFs [20] . 

Notation: Let us denote by N(n, (r, s)) the minimal integer N such that there 
exists an (n, (r, s))-CFF F of size | = N. 

1.2 Previous Results 

It is known that, |32) . 


where 


N{n, (r, s)) > Q{N(r, s) ■ logn) 

d{t) 


N{r, s) := 


d\ 


log(^) 


Using the union bound it is easy to show that for d = r + s = o(n), r < s, 
we have 

N{n, {r, s)) < O log ■ N{r, s) ■ logn^ . 
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D’yachkov et. al.’s breakthrough result, m, implies that for s, n —>■ c» 

-/V(n, (r, s)) = 6> (iV(r, s) • log n). (1) 

The two above bounds are non-constructive. 

It follows from m, that for an infinite sequence of integers n, an (n, (r, s))- 
CFF of size 

M = o((rd)i°«*"logn) 

can be constructed in polynomial time. 

Before proceeding to describe previous results and ours, we introduce some 
convenient terminology: 

We will think of the parameter d = r + s as going to infinity and always use 
the notation o(l) for a term that is independent of n, and goes to 0 as d i—> oo. 
We say an (n, (r, s))-CFF is almost optimal, if its size N = | satisfies 

{ d’’+i+°(i)logn if r = 0(1) 

(^)r+o(r-) log^ if r = a;(l),r = o(d) • 

2H2ir/d)d+o{d) iQg ^ ^ ^ 

where H 2 {x) is the binary intopy function. 

We say that such T can be constructed in linear time if it can be constructed 
in time 0{N{r, dogn-n). In this terminology, our goal is to obtain almost 

optimal CFFs that are constructible in linear time. 

Let us first consider the case of constant r. It is not hard to see that in this 
case an (n, (r, s))-CFF T of size d'^^^ logn is almost optimal by our definition 
(and in fact exceeds the optimal size in ([2]) only by a multiplicative logd factor). 
Bshouty [8] constructs of such size in linear time and thus solves the case of 
constant r. In fact, calculation shows that for any r = of size 

iV = 2*^^ • d''+i • logn 

is almost optimal. Bshouty m constructs such J- in linear time for any r = o{d). 
We proceed to the case of larger r. Fomin et. al. m construct an (n, (r, s))-CFF 
of size 


2^^ ) log n (2) 

in linear time. This is almost optimal when 

f d \ 

r = ui [ - 1 . 

\ log log d log log log d J 

To the best of our knowledge there is no explicit construction of almost optimal 
(n, (r, s))-CFFs when < r < w(d/(log log dlog log log d). 
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Note that in this range (and even for r = a;(l) and r = o{d)), T is almost 
optimal if and only if it has size 

iV = I I log n = I - 1 • fog n. 

Gabizon et. al |18j made a significant step for general r and constructed an 
(n, (r, s))-CFF of size 

0((d/r)2'’'.2'^M-logn) 

in linear time. This is quadratically larger than optimal. 

1.3 Nevif Result 

As mentioned before, there is no explicit construction of almost optimal (n, (r, s))- 
CFFs when < r < a;(d/(loglog(ilogloglog(i) and the result of [TH] is 

quadratically larger than optimal. In this paper we close this quadratic gap and 
give an explicit construction of an almost optimal (n, (r, s))-CFF for all r and s. 
Our main result is 

Theorem 1. Fix any integers r < s < d with d = r + s. There is an almost 
optimal (n,{r, s))-CFF, i.e., of size 

-logn, 

that can be constructed in linear time. That is, in time 

0{N{r, • n • logn) 

As we’ve seen in Section [TT^ the above theorem is already proved for r < d°^^^ 
and r > w (d/(log log d log log log d)). 

2 Applications of result 

2.1 Application to learning hypergraphs 

Let be a set of all labeled hypergraphs of rank at most r (the maximum 
size of an edge e C F in the hypergraph) on the set of vertices V = {1,2,... ,n} 
with at most s edges. Given a hidden Sperner hypergrapf0 G € Gs,r, we need 
to identify it by asking edge-detecting queries. An edge-detecting query Qc{S), 
for S' C F is: Does S contain at least one edge of G? Our objective is to non- 
adaptively learn the hypergraph G by asking as few queries as possible. 

This problem has many applications in chemical reactions, molecular biology 
and genome sequencing, where deterministic non-adaptive algorithms are most 
desirable. In chemical reactions, we are given a set of chemicals, some of which 

^ The hypergraph is Sperner hypergraph if no edge is a subset of another. If it is not 
Sperner hypergraph then learning is not possible. 
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react and some which do not. When multiple chemicals are combined in one test 
tube, a reaction is detectable if and only if at least one set of the chemicals in 
the tube reacts. The goal is to identify which sets react using as few experiments 
as possible. The time needed to compute which experiments to do is a secondary 
consideration, though it is polynomial for the algorithms we present. See [3] 
and references within for more details and many other applications in molecular 
biology. 

The above hypergraph ^ learning problem is equivalent to the problem of 
exact learning a monotone DNF with at most s monomials (monotone terms), 
where each monomial contains at most r variables (s-term r-MDNF) from mem¬ 
bership queries |1I4) . A membership query, for an assignment a G {0,1}" returns 
/(a) where / is the hidden s-term r-MDNF. 

The non-adaptive learnability of s-term r-MDNF was studied in |:-i3l24l2bll7lin!?] . 
All the algorithms are either deterministic algorithms that uses non-optimal 
constructions of (n, (s,r))-CFF or randomized algorithms that uses randomized 
constructions of (n, (s, r))-CFF. Our construction in this paper gives, for the 
deterministic algorithm, a better query complexity and changes the randomized 
algorithm to deterministic. Recently, our construction is used in [3] to give a 
polynomial time almost optimal algorithm for learning Gs,r- 

2.2 Application to r-Simple fc-Path 

Gabizon et. al. |18j recently constructed deterministic algorithms for parametrized 
problems with ‘relaxed disjointness constraints’. For example, rather than search¬ 
ing for a simple path of length k in a graph of n vertices, we can search for a path 
of length k where no vertex is visited more than r times, for some ‘relaxation pa¬ 
rameter’ r. We call the problem of deciding whether such a path exists r-SiMPLE 
/c-Path. Abasi et. al [2] were the first to study r-SiMPLE /c-Path and presented 
a randomized algorithm running in time What is perhaps surprising, 

is that the running time can significantly improve as r grows. Derandoming the 
result of 0 , m obtained a deterministic algorithm for r-SiMPLE fc-PATH with 
running time At the core of their derandomization is the no¬ 

tion of a ‘multiset separator’ - a small family of ‘witnesses’ for the fact that two 
multisets do not ‘intersect too much’ on any particular element. How small this 
family of witnesses can be in turn depends on how small an (n, {2k/r, k — 2k/r))- 
CFF one can construct (details on these connections are given in Appendix IbI). 
Plugging in our new construction into the machinery of |18j . we get 

Theorem 2. r-SiMPLE fc-PATH can be solved in deterministic time 

20(fc/r) . ^O(l) .^3 

For example, when both k/r and r tend to infinity, we get running time O* (r8fe/r+o(fc/r) ^ 
and [E] get 0 *(r-i 2 feA+o(fe/D). 

In a well-known work, Koutis m observed that practically all parametrized 
problems can be viewed as special cases of ‘multilinear monomial detection’. [18] 
also studied the relaxed version of this more general problem: Given an arith¬ 
metic circuit C computing an n-variate polynomial / G 'Z[Xi ,..., A„], determine 
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whether / contains a monomial of total degree k and individual degree at most r. 

We call this problem (r, A:)-Monomial Detection. [TS] define such a circuit C 
to be non-canceling if it contains only variables at its leaves (i.e., no constants), 
and only addition and multiplication gates (i.e., no substractions). [TH] showed 
that for non-canceling C, (r, fe)-MoNOMiAL Detection can be solved in time 
0*{\C\ ■ We obtain 

Theorem 3. Given a non-canceling arithmetic circuit C computing f G Z[Xi,..., X^], 
(r, /c)-Monomial Detection can be solved in deterministic time • 

20(fc/r) .^0(1) .^3 .logn). 


Organization of paper 

In SectionOwe give an informal description of our CFF construction. In Section^] 
we give a simple construction that proves Theorem [T] for any log^ d < r < 
^/(loglogd)'^^^^. In Section O we give the proof for d/< r < d/uj{l). 
The proofs of Theorems [5] and |3] appear in Appendix [BJ 

3 Proof Overview 

Our construction is essentially a generalization of [TH] allowing a more flexible 
choices of parameters. For simplicity, we first describe the construction of [18] 
and then explain our improvements. 

To illustrate the ideas in a simple way, the following ‘adaptive’ viewpoint will 
be convenient: We are given two disjoint subsets C, D C [n] of sizes \C\ = r and 
\D\ = s. We wish to divide [n] into two separate buckets such that all elements 
of C fall into the first, and all elements of D fall into the second. Of course the 
point in CFFs is that we do not know C and D in advance. However, the number 
of different possibilites for the division that will come up in the process will be 
a bound on the size of an analogous (n, (r, s))-CFF- which will contain a vector 
a G {0,1}” corresponding to each way of separating [n] into two buckets that 
came up in the adpative process. 

As a first step we use a perfect hash function h to divide [n] into r buckets 
such that each bucket contains exactly one element of C. Using a construction 
of Naor et. al [28], h can be chosen from a family of size 2'^*^’'^ • logn. Let us 
call these buckets Bi,... ,Br. Now, suppose that we knew, for each i G [r], the 
number of elements Si from D that fell into bucket Bi. In that case we could use 
an (n, (l,Si))-CFF Bi to separate the element of C in Bi from the Si elements 
of D, and put each in the correct final bucket. 

We have such of size c- sf • log n for universal constant c. Thus, the number 
of different choices in all buckets is 

r 

c • • logn < c’’ • (s/r)'^'" ■ log’' n, 

i=l 

as the product of the sds is maximized when si = ... Sr = s/r. Furthermore, 
[T5] show this can be improved to roughly {s/rY ■ logn < {d/rY ■ logn where 
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d = r + s. This is done using the hitting sets for combinatorial rectangles of 
Linial et. al [22] (we do not go into details on this stage here). Of course, we do 
not know the Si’s. However, it is not too costly to simply guess them! Or rather, 
try all options: The number of choices for non-negative integers si,..., such 
that Si -b ... -b Sr = s is at most 



Combining all stages, this gives us an (n, (r, s))-CFF of size roughly ■ 

logn. To get an almost optimal construction, we need to get the 2 in the expo¬ 
nent down to a 1. We achieve this by reducing the cost of the ‘guessing stage’. 
Instead of r buckets, we begin by dividing [n] into k buckets for some k = o(r), 
such that every bucket will contain r/k elements of C. This is done using split¬ 
ters [2H| ■ For concreteness, think oik = r/ log log d. (In the hnal construction we 
need to choose k more delicately). Now as we only have k Si’s, there will be less 
possibilites to go over such that si -b ... -b Sfe = s - specifically less than (ed/k)^. 

On the other hand, our task in each bucket is now more costly - we need to 
separate r/k elements of C from Si elements of D, rather than just one element 
of C. A careful choice of parameters show this process can be done while going 
over at most ((i/r)^+°F) options for the partition into two buckets. 

There are now two main technical issues left to deal with. 

— The splitter construction of [28] was not analyzed as being almost-linear 
time, but rather, only polynomial time. We give a more careful analysis of 
it’s runtime. 

— We need to generalize a component from the construction of [T5], into what 
we call “multi-CFFs”. Roughly speaking, this is a small set of strings of 
length n ■ I that are ‘simultaneously a CFF on each n-bit block’. That is, if 
we think of the string as divided into I blocks of length n, and wish to see 
in each block a certain pattern of weight n in some subset of di indices of 
that block, there will be one string in the multi-CFF that simultaneously 
exhibits all patterns. We construct a small multi-CFF using a combination 
of “dense separating hash functions” and the hitting sets for combinatorial 
rectangles of [22] • See Section |5| for details. 

4 The First Construction 

In this section we give the first construction 

4.1 Preliminary Results for the First Construction 

We begin by giving some definitions and preliminary results that we will need 
for our first construction. The results in this subsection are from |2H] and jSj. 

Let n, q and d be integers. Let be a set of boolean functions / : [(;]'’* —>■ 
{0,1}. Let H he & family of functions h : [n] —>• [gj. We say that H is an 
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{n,T)-restriction family ((n,J^)-RF) if for every {ii,... ,id} C [n], 1 < < 

12 <■■■< id < n and every f G iF there is a function h £ H such that 
f{h{ii),...,h{id)) = 1. 

We say that a construction of an (n, J^)-restriction family H is a, linear time 
construction, if it runs in time 0{\H\ ■ n) = \H\ ■ n ■ poly{\og \H\, logn). 

Let H he a family of functions ft- : [n] —>• [g]. For d < q we say that H is 
an (n, q, d)-perfect hash family ((n, q, d)-PHF) if for every subset S C [n] of size 
[S'! = d there is a hash function h G H such that ft|s is injective (one-to-one) on 
S, i.e., \h{S)\ = d. Obviously, an (n, g,(i)-PHF is an (n, J^)-RF when F = {/}, 
for some / : —>■ {0,1} satisfying /(cti, ..., dd) = 1 iff cti, ..., crd are distinct. 

In [5] Bshouty proved 

Lemma 1. Let q be a power of prime. If q > 4:(d{d — l)/2-\- 1) then there is a 
linear time construction of an (n, q, d)-PHF of size 

log n 
log{q/d^) 

The following is a folklore result 

Lemma 2. Let F be a set of boolean functions f : [g]'^ —^ {0,1}. If there is a 
linear time construction of an {m,F)-RF where m > A[d{d — \)/2 + \) of size s 
then there is a linear time construction of an (n,F)-RF of size 

f sd^iogn \ 

Vlog(m/d2)y ■ 

Proof. Let Hi be an (m, J^)-RF and let H 2 be the (n, m, (i)-PHF constructed in 
Lemma[T] Then it is easy to see that Hi{H 2 ) := {fti(ft 2 ) | ft 2 G II 2 , fti G Hi} is 
an (n, J^)-RF. □ 

Another restriction family that will be used here is splitters [28]. An (n, r, k)- 
splitter is a family of functions H from [n] to [fc] such that for all S C [n] with 
[S'! = r, there is ft € ift that splits S perfectly, i.e., for all j G [k], \h~^{j) 0 S'] G 
{[r/fcj, [r/fc]}. Obviously, an (n, g,d)-PHF is an (n, d, g)-splitter. Define 

u(r,fc):= (3) 

From the union bound it can be shown that there exists an (n, r, fc)-splitter of size 
0{^/ra{r, k) logn), [28]. Naor et. al, [28], use the r-wise independent probability 
space to construct an (m, r, fe)-splitter. They show 

Lemma 3. For k < r, an {m,r,k)-splitter of size 0{y/ra{r, k)logm) can be 
constructed in time 



O (i/r • cr(r, k)mf‘^ log m) . 




When k = uj{,/r), Naor et. al. in [^, constructed an (n, r, fc)-splitter of size 
0{a{r, logn) in polynomial time. We here show that the same construc¬ 

tion can be done in linear time. They first construct an {{rjz)"^jz, fc/z)-splitter 
using Lemma [3] where z = 0(r logfc/(fclog(2r/fc))). They then use Lemma[2]to 
construct an (r^, r/z, fc/z)-splitter. Then compose z pieces of the latter to con¬ 
struct an (r^, r, fc)-splitter and then again use Lemma [3] to construct the final 
In, r, fc)-splitter. 

Note here that we assume that z|fc|r. The result can be extended to any z, k 
and r. 

We now prove 

Lemma 4. For k = w(i/r) and z = 16rlogfc/(fclog(4'r/fc)). An {n,r,k)-splitter 
of size 

r^^^la(r, k) logn = a{r, logn 

can be constructed in time 0{a{r,kY^°^^^ logn). 

Proof. By Lemma [11] in Appendix z is a monotonic decreasing function in 
k and 16\/r > z > 81ogr for i/r < k < r. First we construct an {{rlzY,rfz, 
k /z)-splitter using Lemma |3| By Lemma |3| and Lemma [12] in Appendix [^ this 
takes time 

0{\JVJz ■ ^(r/z, k!z){{r!zYY"^^^ log(r/z)) = o{cr{r, k)). 

By Lemma |3l the size of this splitter is Oi^sjrjz ■ cr(r/z,/c/z) log(r/z)). By 
LemmalU using the above splitter, an (r^, r/z, fc/z)-splitter FI of size 

0{{r jz'f"^a{rjz, kjz) log(r/z) log r) 

can be constructed in linear time. Now, for every choice of 0 = io < *i < 
*2 < • • • < jj,-i < iz = r"^ and /iq, hi,..., h^-i € FI define the function h[j) = 
ht{j) + {k/z)t if it < j < it+i. It is easy to see that this gives an (r^, r, fc)-splitter. 
The splitter can be constructed in linear time and by Lemma [T3| in Appendix 
El its size is 

( l) (ci(^/- 2 )^'^cr(r/z, A:/z) log(r/z) logr)^ = r'^^^(T(r, fc) 

for some constants ci and C 2 . Now by Lemma HI and Lemma [T31 in Appendix [Al 
an (n, r, fc)-splitter can be constructed in time 

0{r^{r^'^^a{r, k)) logn) = k) logn = a{r, logn. 


The following is from [8] 

Lemma 5. There is an {n, (r, s))-CFF of size 

f 2rs 


O 

that can be constructed in linear time. 


V ^ 


logn 


□ 
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4.2 Construction I 

Let r < s be integers and d = r + s. Obviously, 1 < r < d/2 and d/2 < s < d. 
We may also assume that 

r > poly{\ogd) = (4) 

See the table in Section o and the discussion following it. 

We first use Lemma[2]to reduce the problem to constructing a (g, (r, s))-CFF 
for q = 0{d^). We then do the following. Suppose I < h < 12 < ■ ■ ■ < id < q and 
let (Cl, ■ • ■) ■Cd) G {0,1}'^ with r ones (and s zeros) that is supposed to be assigned 
to (ii, ^ 2 , • • • , id)- Let ij-i,..., ij^ be the entries for which Cji,. •., Cir are equal 
to 1. The main idea of the construction is to first deal with entries ij^,... ,ij^ 
that are assigned to one and distribute them equally into k buckets, where k 
will be determined later. This can be done using a (g, r, fc)-splitter. Each bucket 
will contains r/k ones and an unknown number of zeros. We do not know how 
many zeros, say di — (r/fc), fall in bucket i but we know that di + ■ ■ ■ + dk = d. 
That is, bucket i contains di indices of *i,* 2 ,‘‘’ ^id for which r/k oi them are 
ones. We take all possible di + ■ ■ ■ + dk = d and for each bucket i construct 
{q,di — {r/k),r/k)-CFF. Taking all possible functions in each bucket for each 

possible di H- -F dk = d solves the problem. 

Let Hi be an (n, q, d)-PHF such that d^ < q < 2d^ is a power of prime and 
d = r + s. The following follows from Lemma [5] 

Lemma 6. If H is a (g, (r, s))-CFF then {hi{h) \hi F Hi, h G H} is (n, (r, s))- 
CFF of size \H\ ■ \Hi\. 

We now construct a (g, (r, s))-CFF. Let H 2 be a (g, r, fc)-splitter where k < r 
will be determined later. Let H^[d'] and H^[d'] be a (g, d' — [r/k), [r-/fcJ)-CFF 
and (g, d' — {r/k ), {r/k) )-CFF respectively and define H^ld'] := id 3 [d'] U H^[d'] 
where d > d' > |"r/fc]. For every {hi,... ,hk) G H-i[di] x ••• x Hz[dk\ where 
di + • • • + dfc = d and g G H 2 define the function 

Hhi,...,hk,g{i) = hg(^i'j{i). 

We first prove 

Lemma 7. The set of all Hhj^,...,hk,g where {hi ,..., hk) G Hz[di\ x • • • x H^[dk\ 
for some di + ■ ■ ■ + dk = d and g G H 2 is a (g, (r, s))-CFF. 

Proof. Consider any 1 < ii < 12 < • • • < id < g and any (^ 1 ,..., fd) of weight r. 
Let S = {ii,... ,id}- Consider I = {ij | = 1}. Since H 2 is a (g, r, fc)-splitter 

there is g G H 2 such that \g~^{j) fl /| G {\r/k \, {r/k )} for all j = 1,..., k. Let 

dj = \g~^{j) n S'! for j = 1,..., k. Then di + d 2 H- + dk = d. Since H^ldj] is a 

{q,dj — [r/k\, [7’/fcJ)-CFF and {q,dj — [r/k], |’r/fe])-CFF, there is hj G dd 3 [dj] 
such that hj{g~^{j) fl /) = {1} and hj{g~^{j) fl {S\I)) = {0}. 

Now, if = 1 then ii G I. Suppose g{ii) = j. Then G g~^{j) H / and 

Hhu-,hk,g{H) = hj{ii) G hj{g-^{j)r\I) = {!}. 

If = 0 then i^ G S\I. Suppose g{ii) = j. Then i^ G g~^{j) H {S\I) and 

Hh,,...,hk,g{ie) = hj{ii) G hj{g-^{j) n (S'\/)) = {0}. 
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4.3 Size of Construction I 


We now analyze the size of the construction. We will use ci, C 2 ,... for constants 
that are independent of r, s and n. 

Let < q < 2d^ be a power of prime. By Lemma [5] and Lemma [7] the size 
of the construction is 


N:= \Hi\-\H2\- 


y Hsldi] X ■ ■ ■ X H3[dk] 

di-\ - \-dk—d 


where Hi is an (n,q, d)-PHF, H 2 is a (g, r, fc)-splitter and H^ld'] is a {q,d' — 
\r/k], [r/fc])-CFF and {q,d' - [r/fcj, [r/fcj )-CFF. 

Let z = 16r log/c/(/clog(4r/fe)). By Lemma HIT] and 151 we have 


N < Cl ^ ^ fc)(log d) ■ 

logo 

TT d^(‘^di\r/k\\ 




(5) 


di-\ - \-dk—d i—1 




j (2e)’'(logn) 

di~\ - \-dk—d i=l 


j k-\-2 


/ 3i 2 ,x fc/2 X k / ^ \ ‘ 

< (^4^) (2.r(I„g„)(-) (6) 

, (2e,- (0'"” ,og» (7) 


dS]) follows from ([3|) and the fact that (^) < (eajh'f’. (|6]) follows from the 
fact that the number of fc-tuples {di,... ,dk) such that di + ■ ■ ■ + dk = d 
is — cf^id/k)^ for some constant c. ([7]) follows from the fact that 

msKdi+...+dk=dY^i^idi = (d/k)^. 

In summary, we have 




^r^d^log^ 




logn. 
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Now assume r > log^ d (see (HI) and let fc := r/ log log d. 
Since 

16rlogfclogr log^ r log log d 


z log r = 


Cj ~ 

A:log(4r/fc) “ log log log d 


= o(r-), 


12r 12(loglogd)' 


= o(r) 


r^d® log^ d\ 

P ) 


= cs I r 


o(r) 


and djr > 2, we have, 


^ < (cgloglogd)’' 


r(l+o(l)) 


logn. 


This is 


r(l+o(l)) 


logn = N{r, logn 


when 


log^ d < r < 


(log log d)‘^W 


5 The Second Construction 

In the second construction we replace each component H^ldi] x ••• x H^ldk] 
with another construction that is built from scratch and therefore has smaller 
size. The main idea is the following: rather than taking all possible functions 
in each {q,di — (r/fc), r/fc)-CFF in each bucket, we construct what we call a 
“multi-CFF”. We first construct a dense “separating hash family” that maps 
the entries to a smaller domain [g] and separates entries that are supposed to 
be assigned zero from those that are suppose to be assigned one (i.e., they are 
mapped to disjoint sets). This is done in each bucket. We then use the hitting 
set for dense combinatorial rectangles of Linial et. al, [22], to give a separating 
hash family for all the buckets. Then we build a multi-CFF by assigning 0 and 
1 to every possible two disjoint sets. We proceed with the details of the second 
construction. 


5.1 Preliminary Results For the Second Construction 

Let id be a set of functions d : [n] —>■ [g]. We say that id is a (1 — e)-dense 
(n, g, (pi,/92))-Separating Hash Family (SHF) if for every two disjoint subsets 
Si,S 2 C [n] of sizes |S'i| = pi, |S' 2 | = P 2 there are at least (1 — e)|id| hash 
functions d G id such that h{Si) fl d(S' 2 ) = 0. 

The following lemma follows from |8]. 
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Lemma 8 . Let q be a power of prime. If e > 4 (pi /92 + l)/q then there is a 
(1 — e)-dense (n, q, {pi, p 2 ))-SHF of size 

P 1 P 2 logn 

e\og{eq/e{piP 2 + 1)) 
that can be constructed in linear time. 

Let i? C be a set of the form Ri x ... x where Ri C [t]. We say R is 

a combinatorial rectangle with sidewise density 7 , if for every i £ [f], > "f-t. 

A set H C [t]^" is called a hitting set for rectangles with sidewise density 7 if 
for every set i? C [f\^ that is a combinatorial rectangle of sidewise density 7 , 
i?ni? ^ 0 . 

Linial et. al [22] gave the following construction of a hitting set for combina¬ 
torial rectangles. 

Lemma 9. A hitting set for rectangles H C \t\^ with sidewise density 1/3 of 
size \H\ = t^^^^ ■ can be eonstructed in time • 2*^*^^^ 

Let iL be a set of functions h ■. [k] x [n] —?► {0,1}- We say that H is an 

(n, ((pi, 1 , pi, 2 ), • ■ •, (pfc,i, pfc, 2 )))-Multi-CFF (MCFF) if for every k pairs of dis¬ 
joint subsets (5i,i, Si^ 2 ) C [n] of sizes IS'i,!] = pi,i, |S'i, 2 | = pi, 2 , i = 1,... ,k, there 
is h G H such that h{i, Sip) = 1 and h{i, Sip) = 0 for alH = 1,..., fc. 

We now prove 

Lemma 10. There is an {n, {{pip, pip),..., {pkp, Pk, 2 )))-MCFF of size 

( 2 ''(logn) maxpi,ipi, 2 )‘^^^^ /^48p*,ip*,2 

* i=i V 

that can be eonstructed in time n x po/p((maxi pi,ipi, 2 ) 2 ^ logn) 

Proof. We first choose integers qi, i = 1,... ,k that are powers of primes 24pi,ipi,2 < 
qi < 48pi,ipi,2. Since ^{pippip)/qi < 1/2, by Lemma E] there is a 1/2-dense 
(n,gi, (pi,ipi, 2 ))-SHF Hi of size \Hi\ =t = C)((maxi pi,ipi, 2 )(logn)). Let Hi = 
{hip,..., hip}. Let G C [t]^ be a hitting set for rectangles with sidewise density 
1/3 of size IGj = By Lemma El this set can be constructed in time 

fOp) .20ik) = poly {{maxi Pip Pi p)2’^ logn). 

Now for every g € G and every Ri C [qi], of size |i?i| = pi,i, i = 1,... ,k, 
consider the functions /ii,^^,/i 2 ,g 2 ) ■ • ■) define h : [k] x [n] —>■ { 0 , 1 } as 

follows: h{i,j) = 1 iff hi^g.{j) G Ri. 

To show that the set of all such h is an (n, ((pi,i, pi, 2 ), ■ • ■, (pfc,i, Pfc, 2 ))-MCFF, 
consider k pairs of disjoint subsets (Sip, Sip) C [n] of sizes |5'i,i| = pi,i, |S'z, 2 | = 

Pi, 2 , i = I,... ,k. Let H* = [h' G Hi \ h'{Sip)r\h'(Sip) = 0}. Since Hi is a 1/2- 
dense (n,gi, (pi,ipi, 2 ))-SHF, we have |iL*| > \Hi\/2. Since G C [t]^' is a hitting 
set for rectangles with sidewise density 1/3 there is p G G such that hi^g^ G H* 
for all i = 1,... ,fc. Let Ri be any set of size p^i such that hi^g.{Sip) F Ri G 
[qi]\hi,gi{Sip). Then the function h defined above satisfies the following: since 
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hi.QiiSi^i) C i?j we have h(i,Si^i) = 1 and since Ri fl hi^g^(Si^ 2 ) = 0 we have 
h{i, Si^ 2 ) = 0 for all z = 1,..., fc. 

The number of such functions h is 



5.2 Analysis for Construction II 


In the analysis we just replace the size of H^[di\ x • • • x H^[dk\ in the analysis of 
construction I to the new size of a (g, {{di—r/k, r/fc),..., {dk — r/k, r/fc)))-MCFF 
in Lemma [TUI where d^ < q < 2d^ and get 


N < Cl a{r,k) (log d) 

log a 


diA -hrffc —d 

< C[ 

< 


/'c2di\r/k~\ 


0 ( 1 ) k 

s riv irm 

/o \ ^ 

efeV(i2.)iogn ^ 

^ ^ diH- \-dk=di=l 


di~\ - \-dk—d 2=1 

k / k \ 

< C6^dO(l)^0(.)gfeV(12.) ( M log ^ ( n d, 

\k/ V^/ — ^dk=d y'fylj' 

/ \ 7^ I 2 jc 

< (j)" (Ij logn 


Now let r > log^ d and k = r/tp{d) where (p{d) < logd and (p{d) = a;(l). 
Then k = o(r) 




rlog (fi(d) \ , ^ 

= o^y f(d) ) = 


and 


j.O(z) _ ^0((r/fe) log fc/log(2r/fe)) _ 20((^((i) log d/log V3(d)) _ 2 o(r) 


Therefore 


/d\ ’■+“0) 

N = {c 7 Lp{d)Y+°y^y {-\ logn 
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which is 


N{r, s) 


’'(1+0(1)) log „ 


when 


d 


(p((i)‘^(i) 


Since f{d) < logd is any function that satisfies a;(l), the above is true for any 


d 



(logd)‘^(i) 
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A Technical results for the proof of Lemma 0] 

In this appendix we give some proofs of technical results needed for Lemma SI 
Here we assume that r and k are large enough integers 

Lemma 11. Let z = 16r logfc/(fclog(4r/A:)). Then z is a monotonically decreas¬ 
ing function in k in the interval [^/r,r]. In particular, 16^/r > z > Slogr. 
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Proof. From dz/dk\k=x = 0 we get In^x — (ln4r)lna; + ln(4r) = 0. This gives 
two solutions xq, xi for x. One satisfies Inxo > Inr and therefore xq > r and the 
second \n.xi <2 and therefore xi < < y/r. This implies that the function is 

monotone in the interval [y/r,r]. Now since = 16y/r and z\k=r = Slogr 

the result follows. 

We remind the reader that 


Lemma 12. Let z = 16rlogfc/(fclog(4r/fc)) > 8 and k = uj{y/r). 


Proof. First 


Now 


O' ^ = o'(r, < cr(r, k) » . 


[r /r\4»’/2 r /r\ 
tzi-z) 


hr / z 


7 1 4r \ 5^/-^ 

fc log X 


16 log k 

5fc log ^ 
— 16 log k 

5fc 
16 


< a(r, k) 8. 


This with ([81) implies the result. 

Lemma 13. Let z = 16rlog A:/(/i:log(4r/fc)). Then 


r 

z — 




r I = • cr(r, k) 


Proof. First we have 


^ I ) =^{r,k). 


Now 


r 

2 : — 


l)(«0“logllogr) <( 


er 


^A.hz ^ ^7z 


( 8 ) 
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Lemma 14. Let z = 16rlogfc/(/clog(4r/fc)). For r > k = we have 

= cr(r, 


Proof. Let k = y/r ■ (j){r) where (j){r) = a;(l). Then for a constant c there is a 
constant c' such that 


logr“ 


^ , VFlogV 

- ^ (/i(r)log(v^/(/)(r)) 


and there is a constant c" such that 


log(T(r, fc) > c”(j){r)y/r log 


(fir)' 


Now, for some constant c'", 


log ^ _ log^r _ 

logcr(7’, fe) “ f'^ir)\o'i{yfi^lf>ir)) 


0 ( 1 ). 


B Application to parametrized algorithms with relaxed 
disjointness constraints 

In this appendix, for the purpose of deriving Theorems [2] and [U we explain 
how objects related to cover-free families were used by [TH] to obtain certain 
parameterized algortihms. 

Notation. Throughout this appendix, we use the notation Ok to hide k^i^i terms. 
We denote [n] ={1,2,..., n}. For sets A and B,hy {A^ B} we denote the set 
of all functions from A to B. The notation = is used to introduce new objects 
defined by formulas on the right hand side. 

In fact, [H] do not use CFFs directly, but related objects called minimal 
separating families (Definition[2|) that have an additional injectivity property. We 
begin by formally showing that CFFs indeed imply minimal separting families 
of similar size. 


B.l Prom CFFs to minimal separating families 

Hashing families. Recall that, for an integer t > 1, we say that a family of 
functions C { [n] —^ [to] } is a t-perfect hash family, if for every C C [n] of size 
jCj = t there is f £ H. that is injective on T. Alon, Yuster and Zwick [5] used 
a construction of Moni Naor (based on ideas from Naor et al. [^) to hash a 
subset of size t into a world of size t^ using a very small set of functions: 

Theorem 4 ([^ based on Naor). For integers 1 < t < n, a t-perfect hash 
family C { [n] —> [t^] } of size t^i^i ■ logu can be constructed in time ■ 

n ■ log n) 


18 






We will also use the following perfect hash family given by Naor, Schulman 
and Srinivasan [28) . 

Theorem 5 ([28]). For integers 1 < t < n, a t-perfect hash family 'H C { [fc^] —>• 
[t] } of size *) • log k can he constructed in time • fc • logfc). 

Definition 2 (Minimal separating family). A family of functions H C 
{ [n] —>■ [t + 1] } is (t, fc)-minimal separating if for every disjoint subsets C,D F 
[n] with \C\ = t and \D\ < k — t, there is a function h G j-l such that 

- HC) = [t]. 

- h{D) C {t + 1}. 

We show that small cover-free families imply small minimal-separting fami¬ 
lies. 

Lemma 15. Fix any t < k < n. Suppose a (fc^, (t, k — t))-CFF T can be con¬ 
structed in time S. Then a (t, k)-minimal separating family of size Ok{\J-\ ■ • 

logn) Can be constructed in time Ok{S ■ 2^^*^ • logn • n). 

Proof. Fix disjoint subsets C,D F [n] with \C\ = t and \D\ < k — t. It will be 
convenient to present the family by constructing h adaptively given C and D. 
That is, for arbitrarily chosen C and D, we will adaptively construct a function 
h that separates C from D. Function h will be constructed by taking a number 
of choices, where each choice is taken among a number of possibilities. The final 
family TL will comprise all h that can be obtained using any such sequence of 
choices; thus, the product of the numbers of possibilities will limit the size of TL. 
As C and D are taken arbitrarily, it immediately follows that such TL separates 
every pair (C, D). 

1. Let TLo F { [n] —>• [/c^]} be the fc-perfect hash family given by Theorem |4l 
Choose fo G TLo that is injective on C U D — there are • logn choices 
for this stage. 

From now on, we identify C and D with their images in [A:^] under /q. 

2. Note that an element f € T can be viewed as a function / : \kT] —>■ {0,1}. 
Now choose an element /i of the {kf, (t, k — t))-CFF F, with /i(C') = 1 and 
fi{D) =0 — there are |J^| choices for this stage. 

At this stage we have ‘separated’ C from D, and just need to satisfy the 
additional requirement of being injective on C. 

3. Let TL 2 Q { /r^(l) W } be the t-perfect hash family given by Theorem [SJ 

Choose a function /2 € TL 2 that is injective on C — there are *biog k 

choices for this stage. 

The running times and family size are immediate from the construction. 

Plugging in our construction from Theorem [T] to the above we get 

Corollary 1. Fix any t < k < n. A {t,k)-minimal separating family of size 
Okiik/ty^°^'^'^ ■ 2 *^^) . logn) can be constructed in time Ok{{k/tY^°'dl ■ 2 '^b) . 
log n ■ n) 
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Proof. It’s a straightforward plugin of Theorem[T]into Lemma fTSl The only thing 
to notice is that for any t < k, 

N{t, k - < {k ■ iek/tYy+°^^'> = Ofe((fcA)*+°(‘) • 

We proceed to define and construct multiset separators that are smaller than 
those in [TS] . 

B.2 Multiset Separators 

Notation for multisets. Fix integers integers n,r,k > 1. We use [r]o to denote 
{0,..., r}. An r-set is a multiset A where each element of [n] appears at most r 
times. It will be convenient to think of A as a vector in [rjp, where Ai denotes 
the number of times i appears in A. We denote by |A| the number of elements in 
A counting repetitions. That is, |A| = We refer to |A| as the size of A. 

An (r, k)-set is an r-set A G [r]g, where the number of elements with repetitions 
is at most k. That is, |A| < k. For two multisets A,B over [n]. 

Fix r-sets A,B G [rjg. We say that A < B when Ai < Bi for all z e [n]. By 
A G [r]g we denote the “complement” of r-set A, that is, Ai = r — Ai for all 
i G [n]. By A-I-i? we denote the “union” of A and B, that is, (A-|-i?)i = Ai + Bi 
for all i G [n]. Suppose now that A and B are (r, fc)-sets. We say that A and B are 
(r, k)-compatible if A-l-B is also an (r, fc)-set, and |A-|-i?| = k. That is, the total 
number of elements with repetitions in A and B together is k and any specific 
element i G [n] appears in A and B together at most r times. With the notation 
above at hand, we can define the central object needed for the algorithms of [18]. 


Definition 3 (Multiset separator). Let P be a family of r-sets. We say 
that T is an {r., k)-separator if for any {r,k)-sets A, B G [r]g that are {r,k)- 
compatible, there exists F G P such that A < F < B. 

[18] showed that a minimal separating family can be used to construct an 
(r, /c)-separator. 

Theorem 6. / [jg] / Theorem 3.3] Fix integers n,r,k such that 1 < r < k < n, 
and let t = [ 2 fc/rJ. Suppose a {t,k)-minimal separating family TL Q { M 
[t -|- 1] } can be constructed in time /(r, fc, n). Then an (r, k)-separator P of size 
ITLI ■ (r -I- 1)* can be constructed in time Ok{fir,k,t)) ■ {r + I)*)- 

Plugging in our construction of minimal separating families from Corollary 
[T]we get 

Corollary 2. Fix integers n,r,k such that 1 < r < k. Then an (r,k)-separator 
P of size Ofc(r4'=/”+°('=/”)-2‘^('=/’')dogn) can be constructed in time O^. . 

20{k/r) .n.logn) 

The above corollary is an analog of Corollary 3.4 in [18] where the exponent of 
r was 6 fc/r rather than Ak/r oik/r). 
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From this point on we do not give full details, as our theorems follow by a 
direct plug in of Corollary [5] in [TB] as a relpacement for their Corollary 3.4. 

Specfically, using Corollary[2J the algorithm in Corollary 3.8 of [TH] for finding 
a represntative set of a family of multisets V will run in time Ok{\V\ ■ . 

20 (k/r ). Tjiogn) rather than Ok{\'P\ ■ • nlogn) which will translate 

to the running times stated in Theorems [5] and [3] when running the Algorithms 
proving Theorems 5.6 and 5.8 in |18) . 
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